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This document describes the implementation of the Modular 
programming System. It is intended for use by the implementors 
of MPS, and contains the following sections: , PBS; .LBSsl; 

The allocation of registers in the MPL run-time environment: . 
(registers) 

The layout of the fixed part of a dseg: (dseg) 

The layout of the system dseg transfer vector and the global 
fixed tables used in MPS: (sysdseg) 

The machinery which supports function call/return, both 
compiler generated code and the run-time support code: 
(functions) 

The machinery which supports ports and port calls (portcalls) 

The description of MPL object modules, including the code and 
symbol table information blocks: (modules) 

Format of the object code block in an MPL module: (code) 

Structure of the symbol table information in an object module 
as constructed by the MPL compiler: (symbols) 

The implementation of SIGNALS: (signals) 

The temporary string facilities provided for bootstrapping MPS: 
(strings) 

The free storage package used for the HEAP! (fsp) 

An overview of the process by which the MPS is bootstrapped 
into existence initially: (bootstrap) 

(registers) .PBSj.LB3=l; Register Assignments: MPL definitions for 
register assignments can be found in (mps,mpiregs,l:wn) and can 
be iNCLUDii'd. 

o:NULMSG holds the null message for PORT system. 

ljAl first scratch ac 

2;A2 scratch ac 2 

2:RS result stack pointer 

3: A3 scratch ac 3 

li*Ak scratch ac It 

5: AS scratch ac 5 
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6:A6 scratch ac 6 

7:RP record pointer (for field variables) 

10:m stack mark (frame pointer) 

11:S stack pointer 

12 :P pointer to recent PORT 

13.*C code segment 

lli:l) data segment 

15:LB link base register 

16:SJ for system JSP»s 

175SD system data segment 

(dseg).PBS; Dseg Layout: MPL definitions for the fixed part of a 
dseg are given ooth as record declarations and as a set of 
displacements (compile-time constants) in <inps,processdefs,i:wn) 
which can be used as an INCLUDE file. 



Base of frame 

0: FakeExtLoc: ADDR SYSUFL(SD) to catch stack underflow 

1: FakeRetWrd: Z RetLoc pointer to RetLoc in this word 

2s FakeOldM: WORD fake oldF pointer 

3: FakeSysWord: WORD head of enable list, if any 

Ties to stack segment, code segment, this data segmnt itself, 
and owner's dsg 



U: SegNums: 

5: Bases: 

6: StackSeg: 
State 

7: Statepcs 

10: StateM: 

11: States: 



XWD dsegnum,csegnum 

ZWD dsegbase,csegbase absolute addresses 

ADDR stackbase segmented address 

Z pc value for process 

Z base of current stack frame 

Z top of stack 
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Call/Peturn machinery 

12: LocPtr: Z RetLoc used to provide not-in-memory traps 

13: P.etLoc: XCT ZSegmcore /"MOVE C, liases ; Bases 
absolute; 

Ifc': .' MOVS D,c load code base register 

15: JRST @-2{S) start at ExtLoc word in frame 

(sysdseg) .PBS j ,LBS=0j5ystem Dseg Transfer Vector Area 
: at the moment there is no mpl .INCLUDE file for these 
definitions; however, the vector is declared in mplrun, and a set 
of the declarations appears in nucleus. 



*.. .biggest index used so far « 156B * 

*return codeft 

sysrtnsliOB *simple returns* 

sysdrtn=12lt.B ^deallocation returns* 
*call trap cell* 

fnt= h,2B- 
*stack overf low*. 

sysovr= VxB ^overflow on frame allocation* 

sysov2s k6B ^overflow from within body of code% 

sysov3= SOB fcoverf low on AOBJP of local call* 
^allocation* 

xmakea S2B *make array* 

xmkstr- 5UB %make string^ 
^processes* 

create* S6B *create process* 

destroys 60B *destroy process* 

run* 62B *run process* 
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stops 6ii8 Sstop prbcesss 

pshenv= 66B Spush environments 

popenv= 70B Spop environments 
SportsS 

msgtrap* 723 

pendingfault= 7Ji3 

portcall* 76B 
SsignalsS 

SigslQOB 

sigportsl02B 

sigprocessslOliB 

err*l52B 

errport = 15liB 

errprocess=lS6B 

ctunw=l06B Scatch phrase unwind for exits 

ctunw*l06B Scatch phrase unwind for exits 

resume=HOB S"return" from signals 

propsi*sll2B Spropagate signals 
Sjoinss 

joinoothmiiB SJoin PORT and PORTS 

3oinproc=ll6B 
' ■ doinvar=l20B ■ '. 

jointo=122B 

bindsi26B 
S number of arguments checking S 

toofewargs=130B S caller supplied too few arguments % 

toomanyargs5l32BS caller supplied too many arguments S 
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unboundfnsi3i|B % calls on unresolved fn descriptors come 
through here %, 

undefinedsignal*136B % all signal variables are initially 
bound to this signal code %\ 

% binding and other faults associated with control % 

undefinedsignalsl36B % all signal variables are initially 
bound to this code % t 

segfaultslitOB % segment out of VM fault %, 

badportcall=lii2B % system called code for invalid port call 

controlfault=llikB % port call on a non-active process %, 

resolutionfault=li.6B % port call on an unconnected port %, 

portretry *150B % pc of a process which stopped because of 
a resolution or control fault points here for retry of the 
port call %} ' 

% fixed tables % 

xwdtabsllj.08 

argcheck=200B 

(functions) .PBS; MPS Function Calls 

Function returns, arguments, and local variaDles are stored on 
a stack. 

Two machine registers are reserved for addressing the stack, 
a tcp-of-stack pointer S and a frame pointer F. 

The stack, has the following appearance: 

S> /temporaries; 

/"local variables/ - K words 
M> old M 

return link 

saved return (inter-segment calls) 

/"arguments; - m words 
old S> 

The stack is associated with the currently executing 
process (docmos, processes, ; ) 

MPL declarations for a stack frame, both from the Doint of 
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view of the frame pointer ana from the point of view of a 
pointer to the saved return word are contained in the MPL 
INCLUDE file (mps,processdef s,l:wn) . 

The function call F(al, ..., aN) results in the following code: 

Same segment other segment 

PUSH S,al ; 

... ; --same: push arguments 

PUSH S aN 7 
AOBJP S,@SYS0V3(SD) XCT @3UBR(D) 
PUSHJ S,ENTRY(C) PUSHJ S,@SUBR+1(D) 

SUB S.XWDTAB+N J — same! delete arguments 

XWDTAS is global and in a fixed location. 

XWDTABfj/ contains XWD j+3*J+3. 

For inter-segment calls, the cells SUBR(DATA) and 
SUBP+KDATA) contain: 

ADDR RETLOC 
ADDR XENTRY(C) 

The entry (F) PROCEDURE (pi, ..., pM) compiles the following 
code; 

XENTRY: HRRZ A1,@0(S) 

XCT ARGCHECK-XWDTAB-M(Al) 

MOVS D,C 
ENTRY: PUSH S,M 

XCT FNT(SD) 

ADD S,XWDTAB+K-3 

JUMPGE S,@SYS0VR{SD) 

ARGCHECK is global and in a fixed location. 

ARGCHECK/"0; contains PUSH S,L0CPTR{D) . 

APGCHECK/"-j; contains JSP SJ,@T00FEWARGS. 

ARGCHECK/" j; contains JSP SJ,@T00MANYARGS. 

The register SD contains the address of the system data 
segment. . 

FNT(SD) contains MOVE M,S or a jump to the system if 
currently tracing procedure calls. . 

SYSOVR(SD) contains a jump to system code to handle stack 
overflow. 
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K is the size of the stack frame for F's local environment. 
The statement RETURN compiles the following instruction: 

JRST @SYSP.TN(SD) 
where SYSRTN(SD) points to RTNCOD 
RTNCOD: MOVE RS,S 

MOVE S,M 

MOVE M,0(M) 

JRST @-l(S) 

The register RS (currently = A2, see REGISTERS above) is used 
by the caller to retrieve the results. 

Returns with multiple results use the same system code. 

I 

The results are pushed in reverse order, 

RETURNlrl, r2, ... rn) results in 

PUSH s,rn 

PUSH S,r2 
PUSH S,rl 
JRST «3S¥SRTN{SD) 

On the calling side, x «■ p(: ml, m2, ... ran) results in 

SUB S,XWDTAB 
POP RS,X 
POP RS.ml 
, , • 
POP RS,mn 

If the lhs's would endanger the results on the stack (e.g. 
■Yf.Oi), then life becomes more complex. The code for this 
case is 

SUB S,RS 

MOVEM S,mrcell % rarcell is a compiler-allocated temporary 

% 

MOVE S,RS 
POP's using S 
ADD S,/"n,,n; 
ADD S,mrcell 
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This scheme protects the results by moving the stack pointer 
over them ana allows the caller to accept any initial segment 
of the multiple results, (I.e. if the routine returns k 
results and you only want the first 2, then you need only 
store the first two.) 

Multiwora scalars of length n are treated like n single word 
scalars. 

in addition, there are some global data structures associated 
with the function machinery. 

Each linkage (static data) segment contains a few words of 
descriptive information: 

BASES: XWD LBASE,C8ASE; Linkage and code base addresses 
LOCPTR: ADDR RETLOCJ Pointer to return routine 

RETLOC: XCT ZSeginVM jabsclute address of cell in global 
table 

MOVS D,C 

JRST @~2(S); Return through saved actual link 

There is a global table with one entry for each instance of a 
code-data pair: 

ZSeginVM is a two-word cell containing in its first word 
either 

MOVE c, BASES (BASES an absolute address), or 

JSP SJ,SSegFaulT/(SD) if the code segment associated 
with the data segment is not swapped in. 

The second word of each entry is used to save the BASES 
address when the JSP instruction occupies the first word, 
and the segment number of the associated code segment (or 
possibly a link chaining all the entries for one code 
segment together). 

Times for function calls using this scheme: 

Total local call/return is about 29 microseconds. 
Total external call/return is about kS microseconds. 

These times are for calls with no arguments. For n woras of 
args, add 3.9 * n for PUSH' ing before the call. 

(portcalls) .PBS; 

MPS port call machinery 
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Format of a port: 

ports ZVD port.oboectport % pointer to object port to 
which connected and to myself {absolute addresses) % 

msg: xwd 1*00000,0 % message buffer (initially contains 
the null message % 

dsegptr: ADDR dsegaddr % dsegaddr is the absolute address 
of the dseg in which the port resides % 

startup: JRST ^StatePC (LB) % used by portcail machinery to 
resume the process to which this port belongs % 

MPL declarations for ports are contained in the INCLUDE file 
(mps,processdefs,l:wn) . 

code for /"lhs W 'PORT '( portname f», message; /"'} ctchp/ •); 

In-line code: 

The in-line code produced depends on 

<a) whether the portname is local, and 

(b) whether the PORT call is in an expression. 

If the portname is not local, tnen the address of the port 
is calculated and saved, if it is local, tnen it can 
simply be addressed directly. 

If the PORT call is used in an expression, then control 

must reenter the process over the same port and there must 

be a non-null message waiting in the port or an appropriate 
trap is caused. 

The following is a detailed description of the in-line code 
produced. 

First, it may be necessary to save certain registers on the 
stack if the PORT call is part of a comDlex expression 
bein? evaluated. 

Message value is loaded into register Al. 

If the portname is local then 

Load register p with the connection word from the port. 

JSP SJ.^PORTCALHSD) 

If there is a catch phrase, produce code for it here. 
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Else (portname not local) 

Load address of port. 

Load register p with the connection word from the port. 

If the port call used in expression then push port 
address on stack so that can check later. 

Push D register so can restore later. 

This is necessary because may be doing PORT call for 
some other process, which means D not equal to LB. 
System code for port calls sets D to LB when starts up 
process, so when control comes back to this process 
will not have proper value in D. 

JSP SJ,@P0RTCALL(5D) 

If there is a catch phrase, produce code for it here. 

Pop stack to D register. 

If the PORT call used in expression, then POP stack to 
register so that can test later to ensure that control 
has returned over same. port. (Register RP is used for 
this currently) . 

If the PORT call value was to be loaded into some 
register other than Al, then move it. 

Now restore any registers which were saved initially. 

If the PORT call was not used in an expression, then done. 

Else, must check port and message. 

HRRZ p.p. to make P = address of port. 

If portname was local then CAIE P.portaddr 

Else came P,RP compare to address which was saved on 
stack, 

JSP SJ»*PENJ)INQFAULT(sb) to indicate that control did 
not return over same port that left. 

GAMN ac,NULMSG compare message to null message. 

JSP SJ.SMSGTRAP(SD) to indicate that port call returned 
null message. 

If there was a catch phrase, then produce (non -jumping) 
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JUMP instruction here which addresses the start of the 
catch phrase code. 

Finally, MOVEM NULMSG.PortMessageBuf fer to indicate that 
the message has been "removed" for use in the expression. 

code in MPLRUNTIME: 

portcallX: HLRZ A2,P % must ensure that port belongs to 
process in control % 

CAME LB,dsegptr(A2) % compare controlling dseg address and 
dseg addr to which subject port belongs % 

JUMPA BadPortcalKSD) % bomb out if not equal % 

MOVEM s, states (LB) save stack pointer of subject process 

MOVEM M,StateM(LB) save frame pointer 

MOVEM Sj.StatePC(LB) save program counter 

MOVSM P, (P) make object port connected to subject port 

HRRZ LB,dsegptr(p) begin startup of object process by 
getting base of his dseg. 

XCT RetLbc(LB) %move Bases into register C (and check for 
code segment not-in-memory) 

MOVE D, LB load D register (=LB contents) 

MOVE M,StateM(LB) load frame pointer 

MOVE S, States and stack pointer for object process 

JRST startup (P) fire up the object process 

(modules) .PBSj 

(code) MPS Object Code Format: MPL declarations for the blocks 
in an object module may be found in the INCLUDE file 
(mps,moauledef s,l:wn) . 

An MPS object code file consists of a series of blocks, each 
carrying length and type information. 

The first word of a block is Xw'D TYPE, LENGTH. 

The TYPE is one of the following codes: 

- empty space 
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1 - code 

2 -hash table 

3 - symbol data 
li - name table 

5 - structure table 

The LENGTH is the number of words in the block, including 
the header word. 

LENGTH = is an error, except that TYPE*l,ENGTH=0 
signifies the end of the file. 

The rest of the block (LENGTH-1 words) is interpreted 
according to TYPE. 

The first block in the file always contains the code for a 
module. 

The first few words of the code block contain information 
for the MPS loader. 

Word 1 contains a zero if this is a DATA module; 
otherwise 

Word l left half contains 

the displacement of the first instruction of the code 
segment after the initialization code (i.e. the 
procedures) , 

Word 1 right half contains 

the displacement of the first instruction of the 
process body {i.e. after the procedures). 

Word 2 left half contains 

the displacement of the first of the literals (i.e. the 
word following the actual code). 

Word 2 right half contains 

the size of the data segment. 
Word 3 rieht half contains 

hash index for module name. 
Word 1; is the entry for the initialization code. 
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This is a procedure of one argument (the number of 
words of arguments passed to the CREATE operation). 
These arguments have been stored into the first words 
of the data segment, if the count is incorrect then 
there will be a trap to the system like that for a 
procedure call with the incorrect number of arguments. 

The code itself follows. 

The code assumes that the CODE base register is loaded 
with the origin of the file, i.e. the address of the 
TYPE, LENGTH word of the code block. 

Unlike the situation in L10, all subsequent words in the 
code area are genuine instructions. 

Literals follow the code. 

Every module also has a hash table block, a block of semantic 
entries, a block of name strings, and a block giving 
. structural information relating the object code to the source 
statments. 

A type 2 block (hash table) has an extra word following the 
LENGTH word which contains in its left and right halves, 
respectively, the following! 

The left value, x, is the size in words of the index 
portion of the hash table which follows, 

The right value, h, is the number of hash table entries 
in the block beyond the index portiion of the table. 

These two values are required by any routines using 
symllO to access the tables because the routine settbl 
needs to be told x and h as its second and fourth 
arguments, respectively. 

Let m be the value in the rignt half of the 2B6 header; 
then m, x, and h are related as 

m * 2 + x + 2*h 

i.e., m includes the two header words, the number of 
index words, and the hash entries (which currently 
require two words each). 

All relative links between the hash table and the name and 
semantic blocks are relative to the first data word of the 
blockj this is word 1 in the semantic and name blocks, and 
word 2 in the hash table block. 

The format of semantic entries is described in 
(D0CMFS,D0CSYM,). 
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A type 5 (structure) block contains a sequence of words of 
the form XWD lc,, bytes. 

The lc indicates the displacement in the code corresponding 
to the start of a source language statement (SLS). 

The bytes are six bits wide and are used to encode the 
structural position of this SLS relative to its predecessor 
(or to the origin if this is the first SLS). 

The first byte is treated specially: 

BytesN, N IN [0,63] -- go up N levels in the structure, 
in other words, depth in the tree is decreased by N. 

All subsequent bytes are treated the same, namely: 

Byte=o — stop here. 

Byte=N, N IN tl»63] -- take Q=N MOD 16 successors, Q IN 
A, IS;, and then go down M=N/16 levels, M IN (0,3J. When 
go Q successors, increment position at current deptn by 
Q. When go down M levels, M > 0, increment depth by Q . 
and set the position at each new level to 1, 

As many words are used as necessary to encode the new 
position; the lc is simply the same in each worq. Most 
common relationships can be encoded in a single word. 

(symbols) Context usage in MPL(A): MPL declarations for context 
usage in object modules, and symbol type declarations can oe 
found in the INCLUDE file (mps,modulesymdefs,l:wn) . .PBSj 



CTX pdctx 

For module names, procedure descriptors, and directory link 
strings, 

CTX xlctx 

For first word of external or forward procedure calls -- 
not outout. 

CTX x2ctx 

For second word of external or forward procedure calls ■--■■ 
not output. 

CTX lsctx 

For literal strings. 
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CTX metctx 

For tokens and special variables used with MPLMETA. 

CTX mpictx' 

For special use in compiler -- not output. 
CTX prctx 

For outermost scope of module. 

Currently pdctxso, xlctx s l, x2ctx*2, lsctx^, metctx=lj., 
mplctx=> f prctx=6. 

The higher numbered contexts are used for 

INCLUDE «d modules 

procedures 

Format of semantic data entries; MPL declarations for the 
symbol type field and its possible values can be found in the 
INCLUDE file (mps.modulesyradef s,l: wn) , 

in first word of entry 

attributes — 13 bits at position 19 
from right to left in the attribute field 

defned -- defined id 

linked -- referenced but not yet defined 

const -- compile-time constant 

noout --do not output to object file 

typeiS; -- type of the id (see below) 
word 2 of se holds system info 
word 3 of se holds *v 

word k of se is used for constval and dirlink 
word 5 of se is used for nwords and contx 
word 6 of se is used for numarg 



MPS Runtime Reference Manual MPS 10.0 

Mitchell 11 AUG 72 

SRI/XPARC PAGE 16 

(symtypes) Types of entries 

UNDEF=0 undefined 

Semantic entries are initialized to zero, so type is 
automatically UNDEF when entry created, 

PROCUsl local procedure 

Under prctx 

#V is external entry point 

(#V+3 is internal entry point) 

§contx holds the context number for local declarations 

©numarg - number of WORDS of arguments 

If has entry under pdctx 

*V is location of procedure descriptor in dseg 

This should be initialized by CREATE to hold actual 
descriptor. 

May have had entries under xlctx and x2ctx if there were 
calls made on the procedure before it was defined. These 
entries used to fixup such calls, then are "deleted" (not 
output) . 

LAB«2 label 

*V is location of first instruction of the statement 
following the label. 

MWS*3 multiword scalar 

*V is loc of first word 

©nwords a number of words 
P0RT = lj. ports 

♦V is loc of first word 

tnwords a number of words 
SIGC0D=5 signal code 

*V is loc 
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CREATE should initialize this cell to hold its own 
segmented address. All sigcod's are in the dseg, even if 
declared in a procedure. 

©nwords = number of words 

VARs6 normal identifier 

if @s const then tnis is constant and value is in ©constval 

else *V sives loc of variable 

Under metctx -- a special variable for compiler. 

#V gives location in dseg. 

Name indicates which variable. 

'nexttoken' -- pointer to next input token (for META) . 

'outword' -- pointer to word holding next output (for 
OUT). 

♦outline' --pointer to string holding next output (for 
SOUT). 

FIELD-7 id's declared as FIELD or in RECORD 

if sas const then this is a constant field and ©constval 
holds the field descriptor (byte pointer) 

else is variable field and »V gives the loc 

M0DUL3=S names of modules 

Under pdctx 

if #V # then this module was INCLUDE 'd and *V * context 
number for declarations in that module 

else this module name was not INCLUDE <d but has been used 
in CREATE or other such statement 

if ©dirlink # then module was iNCLUDE'd and Sdirlink * 
hash of string used to access the module. 

If the link contained (dir,file, junk) , tnen the string 
will be <dir>file followed by a zero character. 

XPR0C89 external procedure 

Under pdctx 
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if ©dirlink # then this name was listed in DIRECTORY 
and eairlinie s hash of string containing link from 
directory 

The string actually contains everything that was 
written between the parentheses of the link. 

@numarg = number of WORDS of args assumed (or -l if no 
calls actually made) 

#V 5 loc of first word of procedure descriptor in dseg. 

CREATE should initialize descriptor to trap if used 
before bound. 

Entries under xictx and x2ctx used td fixup calls but not 
output. 

ARRAYslo statically allocated arrays 

*v contains addr of first word of array 

REC0RD=11 for id used as name of record 

©nwords contains RECORDSIZE (i.e. the number of words 
needed to hold an instance of this record).. 

PR0CV=12 variables declared to be procedure 

*V is loc of first word of descriptor 

Where possible (i.e. in dseg) these should be initialized 
to trap. 

©nvords is size of descriptor 

STRlNGal3 

*V is loc of pointer to descriptor 

Under Isctx 

literal string — initialized by CREATE 

Under other contexts -- string variables 
ARRAYV=1U array variable (holds pointer to array) 
REGISTEH-15 fixed location scalar 
SIGVAR=16 signal variable (holds signal code) 
UXPR0C=17 unreferenced external procedure — not output 
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N0DENAME=18 symbol used as tree name in MPLMETA construct 
Under metctx. 

*V gives location in dseg for name table index when 
initialize the parser, 

T0KEN«19 symbol used as token in MPLMETA construct 

Under metctx. 

*V gives location in dseg for token when initialize the 
parser. 

Use of father-son linking capabilities 

The se for the name of the module being compiled is the root 
of the tree. 

The hash index for the module name is stored in the object 
file as described elsewhere in this file. 

Immediate sons of this se are 

1) outref XPROC's and literal strings 

2) module arguments 

3) directory names 

if directory entry is an INCLUDE, then tree for the 
included declarations is under the se for the name of the 
module. 

U variables declared in the module 

5) procedure names 

6) labels in the body of the module 
Sons of the procedure name se are 

1) formal parameters 

2) local declared id's 

3) local labels 

Records are structured as tree under the se for the record 
. name. 
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The tree structure reflects the structure used in the 
record declaration. 

(signals) .PBS; 

Significant features 

continued propagation through dynamic scope until signal 
terminated. 

Pass message as well as signal code. 
Return value as result of signal. 
Special UNWIND signal to allow cleanup. 

Signal frame 

A signal frame contains 

a flag set in the return word indicating a signal frame 

pointer to the frame whose catch phrases are being executed 

pointer to the process which originated the signal 

global name of signal code (segmented address) 

signal message 

a flag indicating whether RESUME is legal for this signal 

The signal frame is always on the stack of the process whose 
catch phrases are being executed. 

Signal propagation and termination 

A signal is said to be terminated when some catch statement 
executes either 

a resume, or 

a branch out of the catch phrase (such as RETURN or EXIT). 

In the first case 

the signal frame is deleted 
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If 4 RESUME is legal, then 

the process which originated the signal is given control 

Else a new SIGNAL is generated indicating attempt to resume 
after "unresumable" signal. 

in the other cases 

the stack of the process terminating the signal must be 
unwound back to the point where the signal is being 
terminated. 

Before a frame is deleted* a special signal (UNWIND) is 
given so that any necessary housekeeping can be performed. 

If a signal is not terminated within a process, then it is 
propagated to the creator of that process. The signal frame 
is deleted from one stack and recreated on the other. 

RESUME statement 

RESUME can return a value and thus any signal (SIGNAL, SIGNAL 
• PORT, or SIGNAL PROCESS) can be used in an expression. 

If RESUME does not explicitly specify a value, then the 
special value NULLMESSAGE is usee, if a signal is used in an 
expression and gets back a NULLMESSAGE, then a MESSAGEFAULT 
trap is caused. 

The same trap occurs when a PORT is usea in an expression 
and fails to return a value and when a EMPTY specifies a 
port containing NULLMESSAGE, 

It is possible to attach a catch phrase to the signal to 
handle this trap or others. 

It will often be the case that the signalling program is not 
willing to be resumed. To make this explicit and to allow 
the system to intercept illegal attempts to resume, 'ERROR 
may be used in place of 'SIGNAL. Thus ERROR, ERROR PORT, and 
■'■ ERROR PROCESS ail behave like the corresponding SIGNAL 
statements, except control can not return be means of a 
RESUME. 

Scope of catch phrase and propogatlon of signals 

A catch Phrase covering a statement list may occur at the 
beginning of 
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the program body, 

a procedure body, or 

a block. 

Such catch phrases are enabled throughout the execution of 
the "statement list. They are automatically disabled when the 
statement list is completed or left by a branch statement. 

A catch phrase which covers a function call or port call is 
enabled when control leaves and is disabled when control 
returns. 

A catch phrase is never enabled when it is being executed. 
In other words, the scope of the catch phrase does not 
include the catch phrase itself. 

implementation 

A system routine is called to propagate signals. 

This routine "calls" the innermost enabled catch phrase with 
the signal code, signal message, and pointer to original 
frame as arguments. 

The catch phrase is compiled to use the pointer to access 
local variables of the origial frame. 

Control returns to the system routine if the signal is to be 
propagated beyond this frame. 

Thus the Piain problem for the system routine is to determine 
the entry point of the innermost enabled catch phrase. 

If control left the frame by a function or port call, then a 
catch phrase associated with that call would necessarily be 
the innermost one. 

The reactivation location for the frame (stored as the 
process PC or the return location of the frame above it on 
the stack) is used to look for the presence of a catch pnrase 
associated with the call. 

in the Pi)P10 implementation, catch phrases always begin 
with a jumpa instruction which branches around the body of 
the catch phrase. The system routine looks for the 
presence of this instruction following at the reactivation 
location to determine whether there is a catch phrase with 
the call. 
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If there is no catch nhrase with .the call, then the innermost 
enabled catch phrase is associated with a statement list 
containing the reactivation location. 

To help the system routine find the entry point in this case, 
all frames contain a pointer to the innermost enabled catch 
phrase wnich is associated with a statement list. 

If the pointer is aero, then there are no enabled statement 
list catch phrases. 

A flag in the return word (which is automatically zeroed on 
calls) indicates whether this pointer has been set. If the 
flag is zero, then there are no enabled statement list 
catch phrases. 

If the signal is not terminated oy the catch phrase, then it 
must be propagated. If this is the last (i.e. outermost) 
enabled catch phrase associated with the frame then control 
is simply returned to the system routine, otherwise, control 
is transfered to the innermost enclosing enabled catch phrase 
(which can be determined at compile time since catch phrase 
scope is lexical). 



Signal codes and signal variables 



The signal codes name the signal. They occur as the first 
argument of the signal statement and at the head of catch 
phrase cases. 

There are system defined codes (UNWIND, PQRTFAULT, etc.) and 
user defined codes. 

User defined codes are simply identifiers. There is no 
special declaration for such codes. An identifier used as a 
signal cannot be usea in other capacities within that 
context. 

Since signals can be passed between .processes, it must be 
possible to indicate that a signal code -in one process is to 
be the same as another signal code in another process. This 
is simply a name binding problem and is handled by the usual 
machinery (i.e. signal codes are pound by the same 
mechanisms that are used to bind external procedure names) . 

There are SIGNAL variables which can hold signal codes. 
(SIGNAL variables are to signal codes as PROCEDURE variables 
are to external procedure references) . 
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Misc 

in those cases where a trap can be caused after control is 
returned, special means are required to determine if there 
was a catch phrase associated with the call. 

Examples 

PENDINGFAULT — control did not come Pack thru the same 
port 

MESSAGEFAULT -- expecting message Put did not get one 

in the PDP10 implementation, these traps are initiated by 
inline code following the call (and following the catch 
phrase if there is one). 

The trap is actually a call to a system routine which 
generates an appropriate signal. If there is a catch 
phrase associated with the original call, then it must be 
given a chance to catch this signal. Since the pointer to 
the call is no longer available, it is instead stored by 
the compiler as the address of a JUMP instruction (which is 
actually a NOP) following the system call to produce the 
trap. If there is no JUMP instruction following the system 
call, .then tnere is no catch phrase with the original call. 

Since a test for PENDINGFAULT is always followed by a test 
for MESSAQEFAULT.., there is only one JUMP instruction 
produced which is "shared" by these two. The PENDINGFAULT 
system routine Knows to look past the inline code for the 
MESSAGEFAULT test for the JUMP instruction. 

(strings) ,P3S;- 

This section describes a string system for MPS which we will 
implement in order to get MPS off the grouna. 

We intend that it be reolaced by something closer to the 
proposal in (DOCSTR, ) at some future date. 

Language syntax and semantics 

A variable of type STRING is meant to hold a pointer to a 
string descriptor. 

String descriptors are allocated from a "heap", either 
automatically or by system functions accessible to the 
programmer. 

A STRING variable gets a descriptor allocated for it on 
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procedure entry (or process creation), and deallocated on 
procedure exit (process destruction) . 

A dimensioned string, like a dimensioned array, gets 
its body allocated in the same way. 

The automatic allocator actually associates with each 
frame or process a list of the storage allocated for it, 
so the right thin? happens even if a strxng variable is 
subsequently used to hold a pointer to a user-allocated 
descriptor. 

Types of string descriptors: 

1) Explicit-string descriptor 
points to block of characters 
Fields 

Front 

first character of text block which is contained in 
this string 

End 

first character of text block which follows this 
string 

Maxend 

maximum value for End before overflow this text 
block 

Pointer 

address of text block 

Ident 

This field is available to hold program-specific 
information. It can be written and read by user 
programs and is intended to hold information which 
will help x,he program identify the string. 

2) Implicit-string descriptor 

Fields 

ReadFunction 
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Descriptor of function used to read characters in 
the string. 

WriteFunction 

Descriptor of function used to write characters in 
the string. 

LengthFunction 

Descriptor of function used to find and set the 
length of the string. 

Ident 

Same as above, 
on a read access, the system returns the result of 

ReadFunction {String, Position). 
On a write access to the string, the system calls 

WriteFunction (String, Position, Char). 

When the length of the string is requested, the system 
returns 

LengthFunction (String, 0). 

To set the length of a string, the system performs 

LengthFunction(string, 1, Nchars). 

The LengthFunction is intended as a catch-all for which 
additional uses may be found in the future. 

There is no special syntax associated with strings. 

Assignment for strings is defined as simply copying the 
pointer to the descriptor. 

Mention of a string variable refers to the pointer. 

Special action is taken for literal strings. 

For the moment, literal strings may appear only in the 
program, not as initialization or parameter values. 

Functions to be provided 

Where a string is listed as an argument, a pointer to a 
string descriptor is actually required. 
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Functions for setting the fields of string descriptors 

MakeStroesc (String, Front, End, Maxend, Pointer) 

The descriptor pointed to by String is made an 
explicit-string descriptor with fields set to the values 
passed for the other arguments. 

(The Ident field is unchanged by this operation. It is 
initialized to zero when the descriptor is created.) 

The primary use of this function will be to make a 
descriptor which points to a text body in a private 
storage area. 

(I'm afraid this will require knowledge of how characters 
are counted in text blocks.) 

MakelmpstrDesc (String, ReadFcn, WriteFcn, LengthFcn) 

The descriptor pointed to by String is made a 
implicit-string descriptor with fields set to the values 
passed for the other arguments. 

(Again, the ident field is unchanged.) 

SetDescldent(String, ident) 

The Ident field of the descriptor is set to the given 
value. 

NumWords <- WordsForBody (Nchars) 

Returns count of how many words will be required to hold 
the specified number of characters. 

Functions for getting info from string descriptors 

From any string descriptor 

STP,Type( String) 

STRIdent (String) 

STRLast( String) 

Returns the index of the last character in the string. 

STRLength (String) 

Returns the current length of the string. 

The values of sTRIast and STRLength are calculated from 
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the values of the Front and End fields for explict type 
strings and from the value of LengthFunction for implicit 
type strings. 

The following functions simply provide access to various 
fields of the descriptors. 

From explicit-string descriptors 

STRFront (string) 

STREnd (String) 

STRMaxend (String) 

STRPointer( string) 
From implicit-string descriptors 

STRReadFcn (string) 

STRWriteFcn (String) 

STRLengthFcn ( string ) 

Functions implementing language features 

The following functions are needed to implement string 
features which will someday be added to the language. 

Functions for accessing characters 

These two functions are used to implement string/'exp/ as 
a left hand side in MPL. 

NthChar(Strinsr, Position) 

loads the character from the specified position. If 
position is not witnin the pounds of the String a 
special value EOS (End Of String) is returned, 

SetNthChar(String, position, Char) 

writes the character at the specified position. If 
position is beyond the end of string, an error is 
generated. 

Functions for string construction 

SetStrNull (String) 

Resets the string, i.e. sets End=Front. 
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Setstriength (String, Nchars) 

Sets the length of the string to Nchars, which must lie 
in /"0, Maxend-Front;. 

AppendString (To, From) 

AppendChar(To, Char) 

AppendSubstring (To, From, First, Last) 

If First<0, First is taken as 0; if Last>STRLast (To) , 
Last is taken as STRLast(to). 

AppendBlanks(To, Count) 

Functions for creating and destroying strings 

The lifetime of a declared string is limited to the 
lifetime of the scope in which it is declared. In other 
words, when a procedure returns or a process is destroyed 
all strings which were declared in that procedure or 
process are automatically deleted. 

The following functions provide for the creation of strings 
whose lifetime is explicitly controlled oy the programmer. 

refstring «• MakestringO 

The function MakeString returns a pointer to a descriptor 
for a null string. 

Release$tring (refstring) 

The referenced string is deleted. 

Functions for general storage allocation 

There are also some procedures for allocating and 
deallocating storage on the heap, whose use is not limited 
to strings. 

ref block «■ MakeBlock(n) 

Returns a pointer to a newly created block of n words. 
The block is guaranteed to contain only zeros. 

Releaseaiock (ref block) 

Deletes the referenced block. As usual, the programmer 
is resconsible for ensuring that no pointers to the block 

. . remain. 
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SplitBlock(ref block, n) 

The referenced block is split. After the operation, 
ref block refers to an n-word block consisting of the 
first n words of the old block; the rest of the old block 
is deleted. If n is greater than the length of the 
block, an error occurs. 

BlockSize (ref block) 

Returns the length of the block in words. 

implementation 

The RH of the system word of a procedure frame is the head of 
a chain of automatically allocated blocks for that procedure. 

This includes string descriptors and string and array 
bodies. 

The allocator chains all the blocks it creates through a 
pointer in /"-l, 16:0/ of the block. 

/"-1, 17:18/ is used for the block length and Y-l,l:35/ for 
a free flag. 

There is a bit in the return word in a frame which is cleared 
by PUSHJ and set when the first block is allocated. 

The system and return words in the fake frame in a dseg are 
used in the same way for the automatic storage for the 
process. 

The compiler generates a different return instruction for a 
procedure return if it is possible that automatic storage hs 
been allocated. 

The deallocator must check the bit, since in the future it 
may be possible to have automatic storage which does not 
show up explicitly in the declarations. 

The DESTROY procedure must check the bit for process 
storage. 

For the moment, the literal strings appearing in a program 
are collected together under a new reserved context in the 
symbol table. 

The context number is 3; the "name" is the text of the 
literal; the "value" is the dseg location for the pointer 
to the descriotor. 

When a process is created from the program, the CREATE 
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procedure is responsible for allocating descriptors and 
bodies for the strings, copying their text (which appears in 
the symbol table) into the body area, and setting up pointers 
in the dseg. 

The storage allocated in this way persists for the lifetime 
of the process just like static string variables. 

(fsp) .PBSj 

The free storage package provides a simplified zone type 
storage allocation system. 

The following procedures are found in (MPS, FSP,). 
MakeZone (zone, size) 

The "size" words of storage starting at location given by the 
(virtual memory) address "zone" is initialised to be a 
storage zone. After this has been done the zone can be used 
as a parameter of the following procedures. 

Links used by the free storage package are all maintained 
relative to the starting location of the zone. Thus the zone 
may be relocated without disturbing its use. 

Generates the following signal: 

BadZoneSize --on FDPlO size must be between 6 and 100O00B 
words. 

node ■«- MakeNode( zone, size); 

Returns (virtual memory) address of node of "size" words of 
user storage in the specified zone. 

Generates the following signals: 

BadKodeSize -- size <» 0, 

NoRooralnZone --cannot find space for node of size words. 

NodeSize (zone, node) ; 

Return the size in words of the node. 

FreeNode (zone, node) j 

Release the storage occupied by the node. 

SplitNodeUone, node, size) ; 
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Split the specified node into two sections -- the first of 
which is a node of size words, the second of which is freed, 

(names) .PBS; 

This section describes how processes, as well as variables and 
procedures in then and the modules from which they were created 
are named and accessed. 

The following rules and their consequences apply: 

(a) The context in which a system routine is called, along 
with complete Qualification within that context are both 
required to name a process. 

Process naming; syntax: 

processname ::* /'processidi $(', processid)j 

processid : := . 

If the optional, leading processid is not present (i.e., if 
the processname begins with a ".") then the context within 
which the name match is to be performed is the root of the 
segment naming tree, otherwise, the context within which 
the name is to be matched is the context of the process 
whose dseg address is in the D register ("whose static data 
is current" as opposed to "which is in control" - the Id 
register is the address of the dseg of tne process which is 
in control) . 

(b) Given the segment number of a dseg (possibly a stack 
segment) - which may be obtained by coercing a processname, 
an object in the dseg can be referred to by the syntax 

objectname ;:a processname $('., ,ID)j 

Since a processname must be completely qualified, there is no 
ambiguity in the meaning of the .ID'S. 

Module and process names: 

A complete TENEX file name is a module name. A simple 
identifier used as a module name must be bound to a TENEX 
file name. The mechanisms for accomplishing this are 

(a) at compile time: 

the directory of a MPL program contains an entry such as 

(ModuleName) (directory, file) 

Then any use of ModuleName is equivalent to using the 
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TENEX file <directory>f lie with the normal TENEX 
conventions on file names and completion of incomplete 
file names. 

(b) at execution time: 

The run-time segment space is accessed to determine if 
there exists a process whose name is ModuieName which is 
accessible by the conventions stated below for segment 
names, if such a process exists, a match has been found. 
Otherwise, a stack of directory files is searched to map 
the module name to a file name. A directory file is a 
sequential text file with entries of tne form 

'{ .ID ») ' (directory', file f',nane/M 

on separate lines (anything at all can follow the second 
right parenthesis). 

A directory file is searched from line to the last line 
of the file, in order. If no match is found in the most 
current directory, the next most current directory is 
searched until the directory stack is exhausted, at which 
time a signal (UndefinedModule) is generated. 

Files can be pushed onto the directory stack or removed 
py the MPL procedures NewDirectory (STRING filename) and 
RemoveDirectory (STRING filename). If the argument to 
RemoveDirectory is the null string, the directory file on 
the top of the directory stack is removed; othewise the 
named file is deleted from the directory stack, if in it. 
NewDirectory puts the name of the file on the toD of the 
directory stacK after first removing any occurrence of 
the same name in the stack. 

Binding proedure Names 

An external procedure p which is not declared as a procedure 
variable and is used in a process x is bound at run-time when 
it is first called from within X. This can be overridden by 
an explicit BIND statement at any time or by a call on the 
system-supolied procedure Sindprocedures (dsegname) , 

If there was an entry for p in X's compile-tlme directory 
then p will be bound to the procedure in the declared file, 
if an instance of it exists. If no instance of that file 
exists, the signal ModuleNotcreated is generated. If p had 
no directory entry in X, then the system will attempt to bind 
p by finding an instance of a procedure wth the same name. 
The name is sought according to the following algorithm: 

(NameSearch) : 
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(a) X's sibling processes are searched for an object named 
p of the same type as p in X. If exactly one such is 
found, X.P is bound to it; if more than one such is found, 
a signal, AmbiguousName is generated. 

(b) if no match for p is found among X's siblings, then X's 
parent is searched for a match; if none is found, X's 
grandparent is searched, etc. If the root of the segment 
tree is reached without a match, then the signal 
ResolutionFauit is generated. 

Whenever an unbound procedure variable is called, the 
ResolutionFauit signal is generated so that the binding nay 
be done by any program willing to catch the signal. 

(bootstrap) .P3S; 

This section gives an overview of the bootstrapping process by 
which the MPS comes into existence, both initially, and later 
when MPS exists to bootstrap itself. 

The non-MPS beginning 

An L10 program (on the PDP-IO) maps into the "bottom" of 
memory the following files? 

MPLNUCIEUS 

this program will be given control after the L10 program 
has finished. It completes the .iob of creating the MPS 
environment, it is described more fully in the next 
section. 

MPLRUNTIMS 

this program contains the run-time support code and 
system transfer vector for MPS, The NUCLEUS will help it 
to set up the environment. 

SEGRUN 

This is the segmentation machinery. MPLNUCIEUS will pass 
on information given it about the whereabouts of 
MPLNUCIEUS, MPLRUNTIME, and SEGRUN so that SEGRUN can 
initialize the segmet tables correctly. 

The 110 program then allocates space at the "top" of memory 
for data segments for each of the three above MPL programs. 
The addresses of each of the programs and their data segments 
are placed in a fixed place in the MPLNUCLEUS dseg. 

A stack segment is also allocated below the data segments at 
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the top of memory. It belongs to MPLNUCLEUS and its address 
will be placed in the standard place in MPLNUCLEUS' dseg. 

The MPS bootstrapping NUCLEUS 

Control is given to the MPLRUNTIME "process" so that it 
can initialize the system transfer vector, the SjD register, 
and the AKGCHaCK and XWDTAB vectors. 

All procedure descriptors should either be bound by the 
NUCLEUS (which means that it nust know all the uses of SEGRUN 
from within MPLRUNTIME and vice versa) or should cause a trap 
which the NUCLEUS will translate into a call on the BIND 
routines in MPLRUNTIME (in this case, the NUCLEUS only needs 
to Know about a few procedure descriptors; all others will 
be bound as they are used). 

initializing the MPL runtime package 

Initializes the system vector, the ARGCHECK and XWDTAB 
vectors. 

Initializing the segmentation machinery (SEGRUN) 

SEGRUN can use almost all the normal MPLRUNTIME facilities to 
initialize itself. It is passea the address of a table of 
pairs of addresses (fileseg, dataseg) where, if filesegsO, it 
is ignored, and the associated dataseg address is taken to be 
the address of a stack segment, and if filesegs-i, the entry 
marks the end of the table. 

Creating and starting the MPS debugger (the first true MPL 
process) 

once the MPL environment has been established, the NUCLEUS 
CREATES the MPS DEBUGGER, using all the normal MPL 
facilities. 



